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Abstract. Szemeredi's regularity lemma is an important tool in graph theory which 
has applications throughout combinatorics. 

In this paper we prove an analogue of Szemeredi's regularity lemma in the context of 
abelian groups and use it to derive some results in additive number theory. 

One is a structure theorm for sets which are almost sum-free. If A C {!,..., N} has 
(57V^ triples (oi, 02, 03) for which oi + 02 = 03 then A ^ B U C, where B is sum-free 
and |C| = S'N, and J' ^ as J ^ 

Another answers a question of Bergelson, Host and Kra. If a, e > 0, if > No{a,e) 
and if A C {1, ... , N} has size aN, then there is some d ^ such that A contains at 
least (a^ — e)N three-term arithmetic progressions with common difference d. 



1. Introduction 

Szemeredi's regularity lemma fI^ is an important result in graph theory with numerous 
applications in combinatorics and number theory. It has been described as a structure 
theorem for an arbitrary graph. We give a very brief introduction to the regularity 
lemma which is designed to motivate the results of this paper. The reader may find a 
much more extensive survey in the excellent article of Komlos and Simonovits , and 
a nicely-explained proof of the lemma in 

Let r = (V, E) be a graph and let A, B be disjoint subsets of V . Define the density 
d{A, B) to be the proportion of elements {x,y) E A x B such that xy e E{V). If e > 0, 
we say that a pair [A, B) is e-uniform if 

\d{A',B') - d{A,B)\ <: e 

whenever A' <Z A and B' C B satisfy \A'\ ^ e\A\ and \B'\ ^ e\B\. 

Proposition 1.1 (Szemeredi's regularity lemma). Let e > 0. There exists M = M(e) 
such that the vertex set V{r) of any graph T can be partitioned into [1/e] ^ m ^ M 
sets Vi, . . . ,Vm with sizes differing by at most 1, such that at least (1 — e)m? of the pairs 
(Vi, Vj) are e-uniform. 

Hereafter we will refer to Szemeredi's regularity lemma as SzRL. One reason that SzRL 
has been described as a "structure theorem for all graphs" is the fact that it is possible 
to say much more about the bipartite graph induced by a regular pair {Vi, Vj) than it 
is about an arbitrary graph. As an example of this phenomenon we cite the following 
result. 

While this work was carried out, the author was supported by a fellowship of Trinity College, Cam- 
bridge, England and a PIMS postdoctoral fellowship at the University of British Columbia, Vancouver, 
Canada. 
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Proposition 1.2 (Counting lemma). Suppose that U,V and W are disjoint sets of 
s vertices in some graph T. Write d{U,V) = 7, d(y,W) = a and d{U,W) = (3, and 
suppose that the pairs {U, V) and (V, W) are e-regular. Write T{U, V, W) for the number 
of triangles in U xV xW {that is, triples {u, v, w) such that uv, vw, wu G E{T)). Then 

\T{U,V,W)-a(3^s'^\ ^ 4esl 

Observe that only two of the pairs {U,V), {V.W), {U,W) are required to be regular. 
Proposition 11.21 may be combined with SzRL to prove the following. 

Proposition 1.3. Let T he a graph on n vertices, and suppose that T contains o{n^) 
triangles. Then we may remove o{n^) edges from T so as to leave a graph which is 
triangle- free. 

The notation here is convenient but offers scope for confusion. What we mean is that 
there is a function 5' = S'{6) such that S' —>■ as 6 0, and which has the following 
property. If F contains at most Sn^ triangles then we may remove edges from F so 
as to leave a graph which is triangle- free. 

We have not attributed Proposition as it is not clear to us where it was first stated. 
A slightly weaker result was obtained by Ruzsa and Szemeredi in 1976 At that 
time SzRL had only been formulated for bipartite graphs. The modern j21] formulation, 
together with the ideas of p3^, would certainly imply Proposition II. HI The result is also 
well-known in the literature concerning "property testing": see, for example, pp. 

Proposition II. HI is surprising and interesting in its own right. It also has important 
applications, not the least of which is a simple proof of Roth's theorem that r^{n), 
the size of the largest subset of {1, . . . , n} containing no 3-term arithmetic progression, 
satisfies r^i^n) = o{n). 

One of the results of this paper is an analogous theorem for abelian groups. Let G be 
an abelian group with cardinality A^, and let A C G. A triple is a triangle 

ii X + y + z = 

Theorem 1.4. Suppose that A ^ G is a set with o{N'^) triangles. Then we may remove 
o{N) elements from A to leave a set which is triangle-free. 

In fact, we will deduce this result from the following more general theorem. 

Theorem 1.5. Let k ^ 3 be a fixed integer, and suppose that Ai, . . . , are subsets of 
G such that there are o{N''~^) solutions to the equation ai + ■ ■ • + = with ai G Ai 
for all i. Then we may remove o{N) elements from each Ai so as to leave sets A'^, such 
that there are no solutions to a'^ + ■ ■ ■ + a'^ = with a'^ G A- for all i. 

A simple corollary is the structure theorem for sets of integers which are almost sum-free, 
as featured in the abstract of the paper. 

Corollary 1.6. Suppose that A C [A^] is a set containing o{N'^) triples with x + y = z. 
Then A = B U C where B is sum-free and \C\ = o{N). 

The proof of Theorems ll.4l and ll.5l are in many ways analagous to the proof of Propo- 
sition 11.31 In particular we must prove a regularity lemma in the context of abelian 
groups. Although this regularity lemma (Theorem 15. 2|) is probably the most interesting 
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result in the paper, it takes some time to set up the notation necessary to state it and 
so we do not do so here. We will, however, give a sketch of how Proposition 11.31 follows 
from SzRL and the counting lemma fProposition II .2j) . The deduction of Theorem 11.41 
from Theorem 11.51 and an appropriate analogue of the counting lemma is in many ways 
quite similar. We start with a definition. 

Definition 1.7 (Reduced graph). Let F = {V,E) be a graph and let e > 0. Take a 
partition ViU ■ ■ ■ U Vm satisfying the conclusions of SzRL. Define a new graph F' as 
follows. For each pair consider whether one of the following three conditions is 

satisfied: 

(i) i = j; 

(ii) (Vi, Vj) is not e-regular; 

(iii) d{Vi, Vj) < 2eV3. 

If so, delete all edges from Vi to Vj. Let F' be the graph that remains; we refer to F' 
as an e-reduced subgraph of F. Often, when the parameters are clear from the context, 
we will refer to F' simply as the reduced graph. 

It is not hard to see that if F has vertices then 

\E{T')\ ^ \E{T)\-10e^/^N\ (1.1) 

Sketch proof of Proposition \l.S[ Define, for each 5 G (0, 1], a value e = e{6) for which 
4e(M(e))~^ > S, but so that e(5) — >• as 5 — *• 0. Suppose that F is a graph with 6n^ 
triangles. Consider F', an e-reduced subgraph of F relative to some underlying partition 
Vi U ■ ■ ■ U Vm coming from SzRL. We know from p. HI that F' is obtained from F by 
the deletion of relatively few edges. We claim that F' is triangle-free. If this is not the 
case then it contains a triangle ViVjVk with Vi & Vi, Vj E Vj, Vk E V^. For simplicity 
assume that \Vi\ = \ Vj\ = \ Vk\. Each of these sets has size at least ra/2M(e). Now by the 
construction of F' we see that (Vi, Vj) is e-regular and d{Vi, Vj) ^ 2e^'''^, and similarly 
for (Vj, Vk) and (\4, V)- Thus, by the counting lemma, we see that 

T{V,V„Vk) ^ 4e(M(e))^V > 6n', 

contrary to assumption. □ 

Our proof of Theorem ll.4l will follow a broadly similar scheme. There will be a regularity 
lemma, a counting lemma and, given a set A C G, a definition of an e-reduced subset 
A' of A. 

A somewhat different application of our regularity lemma is to a question raised by 
Bergelson, Host and Kra in |3] . The question was this: 

Question 1.8. Suppose that a,e > 0. Is it true that there is No{a,e) such that if 
> A'"o(a, e), and if A C {1, ... , A^} has size aN, then there is some d such that 
A has at least [a^ — e)N three-term arithmetic progressions with common difference d7 

In fact the question was also asked for 4-term progressions, and it was shown that no 
such result holds for progressions of length 5 and higher. We do not answer the question 
about 4-term progressions here (though see the remarks in ^ IV) . 

Ouestion 11.81 is addressed in ^ There we begin with the following result, whose proof 
is a relatively clean application of our regularity lemma. 
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Theorem 1.9. Suppose that a,e > 0. Then there is No{a,e) such that if G is an 
abelian group of size N > No^a, e) with N odd, and if A C G has size aN, then there is 
some d such that A has at least {a^ — e)N three-term arithmetic progressions with 
common difference d. 

This does not seem, as it stands, to give an affirmative answer to Question 11.81 By 
modifying the argument in some smaU but slightly technical ways, we can answer that 
question. 

Theorem 1.10. The answer to Question M.^ is yes. 

Let us conclude this introduction with a word or two on notation. Let G be a finite 
abelian group and let G* be the dual of G, thought of as the group of characters 
7 : G C. If / : G ^ M is a function and 7 G G* a character, define the Fourier 
transform /(7) = /(^)7(^)- Sometimes, when taking the Fourier transform of a 
reasonably complicated expression, we will use the alternative notation (expression)'^ (7). 
If f,g : G ^ M. define the convolution / * g(x) = J2y fiy)9i^ ~ v)- ^ number of 
simple instances of Young's inequality, such as the bounds ^ ||/||i||fl'||i and 

11/ * fi'lloo ^ ||/||oo||5'||i) "will be used without comment. 

2. A STUDY OF THE GROUP (Z/2Z)"' 

It is rather hard to describe the regularity lemma for a general group G (that will be 
the objective of sections El and Ej) • The group Z/2iVZ, which is of interest as regards, 
say. Corollary 11.61 has all the difficulties of the general case. It turns out, however, 
that everything works out very cleanly in the particular case G = (Z/2Z)". This is one 
more instance (cf. ^21 El 1221) which the consideration of vector spaces over finite 
fields facilitates thinking about questions concerning the integers. For a survey of this 
phenomenon, the reader may care to consult the survey article p."lj. 

This section is devoted to this special case, and is independent of the rest of the paper. 
What we describe is possibly the very simplest situation in which regularity and asso- 
ciated ideas such as the counting lemma can be studied. 

For the remainder of P set G = (Z/2Z)" and write = |G| = 2". Let if ^ G be a 
subgroup. For any g E G we may define a set A~^^ C if by setting 

A%'ix) = Aix + g) 

for X E H. These sets represent intersections of A with cosets of H. We will be 
interested in the Fourier coefficients of A^^ , defined for rj E H* by 

^'iv) = $^A+^(a:)(-l)<^'^>. 

If 

sup 1^(77)1 ^ e|ii| 

then we say that g is an e-regular value with respect to A (and the subgroup H). 

The concept of regularity in this sense has been well-studied as a notion of pseudoran- 
domness for subsets of abelian groups. See [71 for more details, other applications 
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and equivalent formulations. 

If the number oi g E G which fail to be e-regular is no more than eN then we say that 
the subgroup H is e-regular for A. In the statement of the following result, W{t) is 
defined to be a tower of twos of height \t~\ . 

Theorem 2.1 (Regularity lemma in (Z/2Z)"). Let e e (0, |) and let A C G. Then 
there is a subgroup H ^ G of index at most W{€~^) which is e-regular for A. 

Proof, li H ^ G, and if the sets A'^^ are as above, define a quantity ind(yl; H) by 

Observe that ^ ind(A; H) ^ 1 for any A and H. We will define a sequence of 
subgroups 

G — Hq Hi ^ H2 Hk 

of (very rapidly) increasing index. If Hi is not e-regular for A then we will use any 
abnormally large Fourier coefficients of the sets A'^^_ to construct an i^^+i for which 
ind(y4, i/j+i) is substantially larger than ind(74, Hj). This process must terminate after 
a finite number of steps (depending on e). 

The heart of the proof is the process of passing from Hj to ifj+i. This is covered in 
detail in the following lemma. 

Lemma 2.2. Let e G (0, |) and suppose that H ^ G is a subgroup which is not e-regular 
for A. Then there is a subgroup H' ^ H such that \G/H'\ ^ 21^/^' and md{A,H') ^ 
ind{A,H)+e^. 

Proof. There are at least eN values oi g E G for which sup^^Q \ A'^^{r])\ ^ €\H\. Now if 
gi and g2 lie in the same coset of H then the sets A'^^^ and A^^^ are just translates of 
one another. In particular it is easy to see that 

and so A^^^ and A'^^^ have large Fourier coefficients at the same points. This means that 
there are cosets H-\-gi, i — 1, . . . ,K ,WG / H\ ^ K e\G/H\ and points rji, . . . ,r]K E H* 

such that \Ay^{r)i)\ ^ e\H\ for all 51 e if g^. 

Let H' ^ H he the annihilator of the rji, that is to say the set of x such that {x, rji) = 
ioT all i — 1, . . . , K. The bound on \G/H'\ claimed in the lemma is immediate, and we 
must check that ind(74, H') ^ ind(A, H)-\-e^ as stated. As a first observation, note that 
^'(0) and all of the ^'(77^) are equal to \H'\. 
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Now one has 

N\H'\^\H\-iYid{A,H') = = 

g g€G heH 

2 

= EE 

geGheH 
gee h&H 

The sum over h E H may be written in terms of the Fourier transform on the subgroup 
H. One has 

N\H\'\Hfmd{A,H') = K'(r])|W'(^)P- 

The term with rj = is easily seen to be N\H\'^\H'\'^md{A, H). To bound the sum over 
1] ^ from below, write 

geG »?7^0 i=l geH+gi 

^ e^K\Hf\H'\' 
^ e^N\H\^\Hf. 

These observations confirm that ind(y4, H') ^ ind(A, H) + e'^, which is what we set out 
to prove. □ 

To deduce Theorem 12 .If simply carry out the program outlined just before the statement 
of Lemma 12.21 Set Hq = H, and define subgroups Hi inductively. If Hi is not e- 
regular for A, apply Lemma 12.21 with H = Hi, and set -ffj+i = H'. It is clear that 
ind(y4, Hi) ^ ie^, and so this process can take place no more than \_e~^\ times. In that 
time, the index \G/Hi\ has not become more than W{e'^). 

The reader who is familiar with the proof of SzRL will notice some strong similarities 
between that argument and the proof of Theorem 12.11 

The next result is a counting lemma directly analagous to Proposition 11.21 

Proposition 2.3 (Counting lemma in (Z/2Z)".). Suppose that H is a subgroup of 
G, and let gi, g2, 93 £ G. Suppose that \A^^'\ = ai\H\, and suppose that A'^^^ is e- 
regular. Then T{gi, g2, g3) , the number of triples {xi,X2,X3) such that Xi G A^^' and 
Xi + X2 + Xs = 0, satisfies 

\T{gug2,g3) - aia2a3\H\^\ ^ e\H\^. 

Proof. One has, using orthogonality relations for characters. 



ngi,g2,93) = E At,'^{xMT{^2)At,''{xs) 

+2:2 +2:3=0 
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The term with 7 = is precisely 01020:3 One can estimate the remainder with a 
simple P'-l^ inequality. Indeed 



7^0 



1/2 



1/2 



^ sup 

77^0 



the latter step following from the e-regularity of and Parseval's identity. The result 
follows. □ 



Now we define a notion of reduced set, corresponding to the definition of reduced graph 
(Definition [TTj). 

Definition 2.4 (Reduced set). Suppose that A C G, and let H be e-regular for A. 
Define a new set A as follows. For each g & A, consider whether either of the following 
two conditions is satisfied: 

(i) A~^^ is not e-regular; 

(ii) ^ i2ey/'\H\. 

If so, delete all of A~^^. Let A' be the set that remains. 

Observe that (ii) depends only on the coset of H that g lies in. Write X for the set of 
all g satisfying (ii). Let S <^ G/H he the set of cosets met by X, and for each s & S 
select some gs G S H X. Since X = S H A we have 

|X| ^ J2\^h''\ ^ (2e)'^^|5||i/| ^ {2ey/'^\G/H\\H\ = (2e)i/3^. 

ses 

The number of g satisfying (i) is at most eA^, and so 

\A'\ ^ \A\ -3e^/^N. (2.1) 

We are now in a position to prove Theorem 11.41 for G. Recall that a triangle in a set A 
is a triple (x, y, z) G A^ with x + y + z = 

Theorem 2.5. Suppose that A C (Z/2Z)" contains o{N'^) triangles. Then we may 
remove o{N) elements from A to leave a set which is triangle- free. 

Proof. Define, for each S G (0,1], a value e = e{S) for which e(A(e))^^/^ > 5, but so 
that e{6) as 5 — > 0. Suppose that A C G is a set with triangles. Consider A', 
an e-reduced subset of A relative to some subgroup H ^ G of index at most A(e) and 
which is e-regular for A. The existence of such an H is the content of Theorem 12.11 We 
know from 1)2.11) that A' is obtained from A by the deletion of at most 3e^/'^A^ elements. 
We claim that A' is triangle-free. Suppose that it contains three elements yi, ?/2, ys with 
yi + y2 + Vs = 0. But every triple (xi, 0:2, Xs) G with Xi G A'^^" and Xi + X2 + Xs = 
gives rise to a triangle {xi + yi, X2 + 2/2, xs + y^) in A. By Proposition 12. HI and the fact 
that the Xj satisfy neither condition (i) nor (ii) in Definition 12.41 the number of such 
triples is at least e|ifp, which is more than SN"^. This is contrary to assumption. □ 

In ^101 we will give an example in the spirit of Gowers 110] which shows that the huge 
bound which occurs in Theorem 12.11 is to some extent necessary. For now, however, we 
press on with the main aim of the paper, which is the generalisation of the above to an 
arbitrary finite abelian group. 
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3. The treatment of arbitrary abelian groups - introduction 

There is one rather obvious obstacle to generahsing the results of ^to a general abelian 
group G: the lack, in general, of a plentiful supply of subspaces. In place of them, we 
will use Bohr neighbourhoods. 

For the purposes of this paper, we will define the argument arg 2 of a complex number 
z to lie in the interval (— vr, vr]. Let F = {71, . . . , 7^} be a set of characters on G. Define 
the Bohr neighbourhood i?r,<5 by 

Bt,5 = '■ \ arg7j(a;)| ^ 2-n5 for all j = 1, . . . , d} . 

It is convenient to write ||x||r = sup^- 1 2^ arg 7^ (x) | , so that By, 5 is simply the set {x : 
\\x\\t ^ S}. When G = (Z/2Z)", Bohr neighbourhoods are just subgroups. For other 
groups this is not the case, and in general there are some fairly substantial differences 
between their behaviour and that of a true subgroup. In Z/A^Z the Bohr neighbourhood 
B = By s tends to resemble a dimensional convex body, so that typically \B + B\ will 
be of cardinality closer to 2*^151 than to \B\. Thus B fails to behave like a group to 
a substantial extent, which (it turns out) means that there is no sensible way to do 
harmonic analysis on B, even approximately, in the manner we described in ^ 

The method we use to get around this stems from a beautiful observation of Bourgain 
jS]. li B' = -Br,5'; where 6' -C 6, then B' tends to resemble a scaled-down version of B. 
It might then be expected (perhaps by thinking geometrically, imagining B and B' to 
be convex bodies) that \B-\-B'\ ^ \B\. Roughly speaking pairs of Bohr neighbourhoods, 
one much smaller than the other, are an appropriate substitute for subspaces. We will 
see in later sections that it is possible to do a sort of approximate harmonic analysis on 
such pairs of Bohr sets. 

4. Properties of smoothed Bohr neighbourhoods 

In this section we define what may be called smoothed Bohr neighbourhoods and es- 
tablish the basic properties of these functions that we will need. These are needed 
because there was one respect in which the discussion of ^was too simplistic. It turns 
out that -Br,(i-K)5 and B^ s can be quite different, even for very small k. For a simple 
example, take G = (Z/SZ)" and S = 27r/5. This kind of behaviour means that Bohr 
neighbourhoods do not always behave in a similar manner to convex bodies. 

Bourgain circumvented this obstacle by showing that for a fixed F, most values of 6 are 
such that -Br,<5 behaves in what he calls a regular fashion. This makes the details of 
the argument even more difficult. In an exposition of Bourgain's work, Tao [23] effects 
a significant simplification by putting this averaging over S into the definition, getting 
a kind of smoothed Bohr neighbourhood. We give a different construction which is 
nonetheless inspired by this idea of Tao. 

This is a technical section of the paper the reader will lose little by simply looking at 
the definition of the functions ipr,s (Definition I4.3|l and very briefly checking out the 
statements of their properties as laid down in Lemma 14.41 We begin with some simple 
properties of (unsmoothed) Bohr neighbourhoods. 
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Lemma 4.1. Let G be an abelian group of size N, let T = {71, . . . ,7^} be a set of d 
characters on G and let 6 > 0. Then 

(i) \Br,s\ > S'N. 

(ii) \Br,2s\ ^5'\Br,5\- 

Proof. If t G M/Z write |t| for that representative of t (mod 1) which hes in the interval 
(—1/2, 1/2]. For any rj, write 5*^ for the set of all for the set of all y = (j/i, . . . ,yd) G 
W^/Z'^ for which \yj\ ^77/2 for all j = 1, . . . , d. Now if x E G write 

v{x) = (;^arg7i(x),...,^arg7d(x)) G R'^/Z'^. 

If v{x) and v{x') both lie in some translate a + Ss then x — x' E Br^s, and so for fixed 
x' G v{G) n (a + Ss) the map x ^-^ x — x' defines an injection from v{G) fl (a + Ss) to 
i?r,5- Hence 

\v{G)n{a + Ss)\ < \Br,s\. (4.1) 
Proof of (i). By a simple averaging there is some translate a + Ss such that 

\v{G)n{a + Ss)\ ^ \Ss\\v{G)\ = \Ss\ = 5''|G|. 

The result is now immediate from ()4.1|) . 

Proof of (ii). From ()4.1|) one has 

\v{Br,2s)n{a + Ss)\ ^ \v{G) n {a + Ss)\ ^ \Br,s\. 
Now f (-Br,2(5) n (a + Ss) is empty unless a G 5*55, and so 

|5r,25| = [ \v{Br,25)n{a + Ss)\da ^ l^r,^! ■ ^ = S'^-I^r,^!. □ 
\^s\ Jm.d/^d \bs\ 

Lemma 4.2. Let G be an abelian group of size N, let T = {71, . . . ,7^} be a set of d 
characters on G and let 6 > 0. Define the smoothed Bohr neighbourhood B^^s by setting 
Br,six) = Br,tix)^-^ dt and define (3r,5{x) = Br,s{x)/\\Br,5\\i- Then 

(i) = 1; 

(ii) \\f3r,s\\oo<S/6^N; 

{iii) For all x,y e G, \Pr,s{x) - Pr,5{x - y)\ ^ 5 sinh (^^^^j /3r,5(a;). 
(iv) For all 7]>0, ^ f3r,s{x) ^ 2 ■ 5'^e~'^^^^ . 

Proof, (i) is trivial. Before proving (ii), note that for all x G G one has Br,s{x) ^ 
eBrs{x). Indeed if x G -Br5 then Brt{x) = 1 for all t ^ 6, and so 



Br,s{x) ^ J —^dt = 1/e. 



Part (ii) of the lemma is an immediate consequence of this observation and Lemma 14.11 
(i). To prove (iii), it is easiest to prove the corresponding statement for the unnormalised 
functions -Br,5- Write rj = \\y\\r- Then y G -Br.r?- Suppose that Br,t{x) — Br,t{x — y) 7^ 0. 
Then either x G B-p^t and x—y^ B^^t, which means that x G -Br,A-^r,i-r;; or else x ^ i?r,t 
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and X — y E Bp t, in which case x G Br^t+rj \ -Br,t- Thus certainly x G Br^t+t] \ -Br,*-??- 
We have, then, 

Br, 5 (x) - Br, six - y) ^ J \Br,tix) - Br,tix - y) \ dt 

roc -t/5 

^ {Br ) - Br,t~-n{x)) dt 

5r,„(x)^-rf?i + 2sinh(^jj y Br,u{x)—^du. 

Now if ^ 5 then e"^'''^ ^ sinh(?7/5), and the result is immediate. If r/ ^ 5 then we 
instead use the estimate 

5r,.(x)^rfn ^ -y 5r,„(x)rfM ^ -^Br,5{x) ^ -jBr,s{x). 

Since x ^ sinhx for a; ^ 1, part (iii) of the lemma follows. Finally we prove (iv) by 
using Lemma l4.ll (ii), working once again with the unnormalised functions Br.s- One 
has 

-t/5 



\\^\\r>ri 



\Br,t\^dt 





\Br,2u\ — ? — du 





r?/2 

^ 2-5M \Br,u\^^du 



< 2-5V''/'^ / \Brf—du 



r,/2 

''^ / |5r,«| — 
J'n/2 

^ 2-5V^/2^||fir,5||i. 
This concludes the proof of (iv) and hence of Lemma 14.21 □ 

We are now ready for an important definition. 

Definition 4.3. Let G be an abelian group, let F = {71, . . . , 7^} be a set of d characters 
on G and let 5 > 0. Define 

^r,<5 = /3r,5 * /3r,5, 

where j3r,5 is the normalised and smoothed Bohr neighbourhood defined in the statement 
of Lemma 14.21 

The following is a very long and rather disparate collection of properties enjoyed by the 
functions '4't,&i ah of which will be required later on. 

Lemma 4.4. Let 5,5' > Q and suppose that F, F' are two sets of characters with |F| = d, 
|F| = d' and F C F'. Let x,y be elements of G, let f : G ^ M. be a function with 
II /II 00 ^ 1 and let r G (0, 1/4). Consider the functions ipr,s and il>r',S' as defined above. 

(i) ipr 6 has real and positive Fourier transform. 

(ii) ll^r,5||i = 1. 

(iii) ||^r,5||oo ^ S/5''N. 
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(iv) Suppose that 6 ^ 2~^^r^/(i and that 7 G F. Then ||(7 — l)^/'r,(5||i ^ t, o-nd conse- 
quently ipr,s{l) ^ 1 — T. 

(v) l^prA^) - i^rA^ - y) I ^ 5 sinh (^^) iprA^) ■ 
For parts (vi) - (viii), assume that 6' ^ 2~^^6t'^ /d' . 

(vi) Let m be a positive integer. Then for any x E G we have 

where the notation indicates that there are m copies of iI)y',8' in- the convolution. 

(vii) llV'r,^ * V'r',^' - V'r.^lli ^ r. 

(viii) \\{f^r,s) * - ^rj(/ * ^r',5')ll2 ^ r. 

(ix) Let K,,uj > 0, and suppose that E G* is such that 'ipr,s{'y) ^ i^- Suppose that 
6' ^ uj"^ n"^ 5 / 2^^ d' . Then ||(7 — l)?/'r',5'||i ^ ^- -^'^ particular, ipT',&'{l) ^ 1 — 00. 

Proof, (i) and (ii) are immediate, and (iii) is an easy consequence of (ii) and Lemma 
14.21 part (ii). To proceed further, we need to estimate the tails of ipr,5- Let rj > he 
arbitrary. We have 

^r,5{x) = ^/3r,5(y) Yl (^^A^-y) 

lkllr^»7 y WAW'^v 

^ 5Z (^^Ay)+ sup Y f^r,5{x-y). 

Now if ||x||r ^ 1] and \\y\\r ^ ^7/2 then ||a; — y\\r ^ t]/2. Therefore 

Y ^tA^) ^ 2 Y My) ^ 4■5V^/^^ (4.2) 

this last step following from Lemma [4.21 fiv). Equation ()4.2|) is one that will be of much 
service in the sequel. To prove (iv), set rj = 166d + 64(5r~^ and note that the condition 
6 ^ 2~^'^t'^ /d implies that rj ^ r/16. Therefore 

^ |1 - 7(x)|V'r,5(a;) ^ 2 ^ V'r,5(a;) + sup |l-7(x)| 

^ 8 ■ 5 V^"/^-^ + 8r7 
^ r/2 + r/2 ^ r. 

To prove (v) write rj = \\y\\r, so that y G -Br,??- Lemma Hill (hi) teUs us that \Pr,s{x) — 
Pr,5{x — y)\ ^ 5smh.{ri/6)Pr,s{x). Thus 

\i^r,s{x) - iJr,s{x - y)\ ^ ^/?r,^(^) \Pr,s{x - z) - (3rA^ - y - z)\ 

Z 

^ 5sinh(r7/5) ^/5r,5(2;)/5r,5(x - z) 

z 

= 5smh{ri/6)ipr,s{x), 

which is exactly (v). An immediate consequence of (v) together with the inequality 

1^1/2 _ ^1/2 1 ^ a-i/2|g^ _ 5| ig ti^g bound 

4^6 i^) - - y)\ ^ 5 sinh [Ml^j ^V2(^)^ (4.3) 
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which will be of some use later on in the proof of the lemma. Now recall that parts 
(vi) - (vii) of the lemma are to be proved under the assumption that 5' ^ 2~^^ dr"^ / d' . 



We begin by estimating the sum 5 sinh (^■'^^j ipr' ,5' (y) , which arises in applications 

of (v) and equation ()4.3|) above. Let r] = I606'd'/T, and split the sum into the ranges 
\\y\\r ^ V and ||y||r ^ V- The sum over the first range is trivially bounded by sinh.{ri/6). 
To bound the sum over the second range, observe that 



sinh 



\y\\r 

6 



sinh 



cosh 



^ ^r',5'iy)dt. 



Now since F C F' the set {y : \\y\\r ^ t} is a subset of {y : ||y||r' ^ t}. Moreover, since 
cosh a; ^ e^, we can use (I4.2|l to bound 



i j coshQ^ ipY',6'{y)dt 



4 ■ S"'' 
6 



16 ■ 5'^ 5' 



-V/SS' 



This, it can be checked, is at most r/10. It remains to observe that 2smh.(ri/6) ^ 
4:rj/6 ^ r/10. Adding everything together gives the bound 



^5 sinh 



ipT^S'iy) ^ r. 



Equations ()4.H|1 and ()4.4j) immediately imply that 



(4.4) 



(4.5) 



which easily implies part (vi) of the lemma in the case m = 1. To prove the result for all 
m we proceed by induction. Supposing the result to have been proved from m = I — 1, 
we have the inequalities 



l/2x 



,1/2 



1/2 



^ r(2'-i - * ^r',5'{x) + tt^^^^ (x) 



1/2 



1/2/ 



r2'-Vr('(^) + r(2'-^ - 1) {^'^[s * ^r',s' - ^r[s)i^) 



i«-i 



,1/2 



,V2n 



^ (2' - l)r^^/,^(x). 

Thus the result is true for m = / as well, which confirms (vi). Part (vii) of the lemma 
is an immediate consequence of (v) and equation ()4.4j) . Moving on to part (viii), an 
immediate consequence of ()4.5|1 is that 



which implies the required result. Finally, we prove statement (ix). For any fixed y we 
have, by (v). 



,1/2 



1/2, 



K\l-^{y)\ ^ |V;r,5(7)||l-7(2/)l 



{^r,5{x + y)- ipr,s{x)) 7(x) 



^ 5 sinh 



\y\\r 
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The result therefore holds if ()4.4|) is true with r = nuj, which it is if 5' ^ K^iJ^b jl^'^d! 
by a calculation almost identical to the one we did earlier to establish ()4.4j) . □ 

5. The regularity lemma 

Let i? C G* be a set of d characters, and let r/ G (0, 1). Given R and r/, we will always 
write = ipR,ni, "^2 = i'R,r]2 where rji = t] and ri2 = 2~'^^e^rj/dk'^. The functions iIjy^s are 
those defined in ^ whose properties were laid out in Lemma f4. 41 Recall that in ^we 
outlined the need for a pair of Bohr neighbourhoods B and B' , with B' much smaller 
than B. This pair of functions ipi and il)2 turn out to be the correct way to realise this 
idea. Observe that, by Lemma f4.4l fvii). we have 

11^1 *V'2 -^1 111 ^ 2-12^-2^1 ^5_^) 

This is the most useful way of quantifying the need that |i? + -B'| ~ |i?|. We will be 
working with a set called A, and also with sets named Ai, . . . , A^. Write af = A*ipi{x) 
and af^ = Ai * ipi^x), and similarly for ip2', it is perhaps unusual to use superscript 
notation for a function of x like this, but it is a useful way of making some of our later 
formulae more readable. We will also write A'^^{n) = A{x + n), so that A'^^ is the 
characteristic function of A translated by x, and write Af^{n) = Ai{x + n). 

When we write ipi and ip2 we will always assume that they come from some underlying 
set R and parameter t] in the manner just described. We will always write |i?| = d. 

Definition 5.1 (Regularity). Let e > and let x G G. We say that x is an e-regular 
value with respect to the set A ( and the pair (i?, 77) ) if the following is true. 

(i) E,(«r'-«f)'v^i(z/) ^ 

(ii) ||((A+--af)^2ri|oo ^ 6. 

We say that (i?, rf) is e-regular for the set A if the number of x G G which are not 
e-regular is less than eA^. 

We are now in a position to state our regularity lemma. Little extra work is involved in 
proving a version which holds for k sets Ai, . . . , simultaneously instead of one, and 
we do this in order that we may prove Theorem 11.51 Write W{t) for a tower of twos of 
height \t\. 

Theorem 5.2 (Regularity lemma for abelian groups). Let k he a positive integer, and let 
e > 0. Let Ai, . . . , Afc he suhsets of G. Then there is a pair {R, rj) with d ^ W {2^^ k"^ e^^) 
and f] ^ 1/W{2^^k'^e'^) which is e-regular for each Ai. 

Define the zth index of {R, rj), the index with respect to Ai, by 

ind,(i?,r/) = N-'\\a,,i\\l = X^Ki)'- (5-2) 

X 

Define also the (total) index 

k 

md{R,r]) = ^ indj(i?, 77). 

1=1 

Observe that ind(i?, rj) ^ k. The main result of this section is the following. 
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Proposition 5.3. Suppose that {R, rj) is not e-regular for all of Ai, . . . , A/^. Then there 
is a pair {R,rj) with \R\ ^ {2dk / rjeY'^'^ and rj ^ (r]e/2dky^'^ such that md{R,rf) ^ 
md{R,ri) + 2-^°k-h\ 

Once this is proved, it is a sliort step to Theorem 15.21 Start with the trivial pair 
{R, Tj) = (0, 1). If this is not e-regular for all of Ai, . . . , A^ then apply Proposition 15 .31 to 
get a new pair {R, rj) . If this is not e-regular then apply Proposition 15.31 again, and so 
on. The index increases by at least 2~^^k~^e^ at each iteration, and so the total number 
of steps cannot exceed 2^°/c^e~^. When the algorithm finishes we have a regular pair 
(i?, rj) and it is not hard to see that \R\ and rj satisfy the claimed bounds; one can afford 
to be incredibly crude when examining the growth of and the decay of everything 
other than the number of iterations being essentially irrelevant. 

Let us begin to address Proposition 15.31 Suppose that (-R, r/) fails to be e-regular for all 
of Ai, . . . , Ak- Then there is some i together with at least tN/k values of x which fail to 
be e-regular with respect to Ai. For the rest of the section write A = Ai] when we talk 
about values or pairs being regular, it will always be with respect to this underlying set 
A. 

Now there are two ways in which {R, rj) could fail to be e-regular: either clause (i) of 
Definition 15. II fails for at least eN/2k values of x, or else clause (ii) does. We shall deal 
with these two possibilities separately in Propositions 15.61 and 15. lO] which together give 
Proposition 15 .31 immediatelv. We begin with a technical lemma which will be used three 
times in the sequel. 

Lemma 5.4. Suppose that 0i, 02 (iiT'd f are functions from G toM. such that * 02 ~ 
0i||i = K and ll/lloo ^ 1- Write fi = f * (pi- Then 

ll/2il^-||/i||^ > EE(/2(^ + 2/)-/i(^))Vi(l/)-8«:iV. 

X y 

Proof. It is straightforward to check the identity 

$^(/2(x + 2/)-(/2*0i)(x)f 0i(l/) = /2'*0i(x)-(/2*0i(a;))^ (5.3) 
y 

Write E{x) = (/2 * 0i — fi){x). We have 

\E{x)\ = ||/*02*01-/*0l||oo ^ ||/||oo||02*01 -0l||l ^ 1^- 

Substituting fi = f2*4>i + Em two places in ()5.3|1 . conducting some simple manipula- 
tions, and summing over x proves the lemma. □ 

Corollary 5.5. For any j G {1, . . . ,k} we have indj(i?, 772) ^ mdj{R,ri) — 2^^k^'^e^. 

Proof. Apply lemma EIU with 0i = "01, 02 = "02 and f = Aj. By ()5.H) the hypotheses 
of the lemma apply with n = 2^^'^k^'^e^. Thus 

. wp , . wp , ll/2|ini/l||^ ^ E.E, ja^,^' - a^,Y My) _9,__2 3 

mdj{R,ri2) -mdj{R,r]) = — ^ ^ — 2 k e, 

which is at least — 2^^/c~^e^. □ 

Proposition 5.6. Suppose that there are at least eN/2k values of x for which ^y{oL2^^ ~ 
alfi)i{y) > e^- Then md{R,r]2) ^ md{R,r]) + /8k. 
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Proof. Apply Lemma 15.41 with ipi = (pi, ip2 = 02 and f = A = Ai. The hypotheses of 
that lemma hold with k = e^/32k. One therefore has 

mdi{R,r]2)-mdi{R,ri) = — ^ ^ — e /Ak ^ e /4/c. 

It follows from this and Corollary 15.51 that 

k 

ind(i?, r]2) - ind(i?, rj) = ^ (indj(i?, 772) - indj(i?, i])) 

i=i 

^ + 5Z ^2) - ind,(i?, r^)) , 

which is at least e^/8k. □ 

We now begin working towards Proposition 15. 10( which deals with the possibility that 
there are at least eN/2k values of x for which || ((^4+^ — a|)'?/'2)^ ||oo > e- The next 
few lemmas are all relevant to this endeavour. Before formulating them, let us define 
the (approximate) orthogonal complement of 1^2, "4^2 •> be the set of all 7 for which 

^^2(7) > e/6- 

Lemma 5.7. Suppose that 7 ^ ^z^^, and that \{{A'^^ — af)^/'2)^ (7)| ^ Then for all 
y e 5fl,,^2/6o + x we have - al)ip2)^ ^ e/2. 

Proof. Since 7 G ip2y one knows that \{A'^^ilj2)^{'y)\ ^ 5e/6. Now suppose that m G 
BR,ey 60- We have 

(v4+(^+™V2)''(7) = 7(m)^v4(x + n)^/'2(n + m)7(n). 

n 

But 

^ A(x + n)?/'2(ra + m)7(r2) - (^+^^2)^(7) = v4(x + n)-f{n) {i)2{n + m) - ^2(n)) 

n n 

and so 

|(A+(^+-)^2)^(7)-7M(A+>2)^(7)| ^ ^|^2(ri + m)-^2H| 

n 

^ 5sinh(e/60) ^ e/6, 

the last step being a consequence of Lemma 14.41 (v). Therefore \{A~^^^~^^^ ^2)^(1)1 ^ 
2e/3, and so finally 

-a^+")^2)''(7) > e/2. □ 

Lemma 5.8. Lei U C G be a set and let n > 0. Then there is K ^ {2/nY, disjoint 
sets 5*1, ... , Sk ^ U and points zi, . . . , zk G U, such that U^i ^ 1^1/2 CL^d Si C 
+ /or eac/i i. 

Proof. Set A = i?H,K/2- We define S*!, 5*2, . . . and zi,Z2, ■ ■ ■ inductively. Suppose we have 
defined Si, ... , Sj, and write Uj = f/\IJi=i ^i- If l^^jl ^ 1^1/2 then stop; at such a point 
one does indeed have | IJi=i ^ 1^1/2- Otherwise, a simple averaging argument shows 
that there is z such that |f/,-n(A+z)| ^ \Uj\\A\/N ^ |f/j-| |A|/2A^. Set 5*^+1 = Ujn{A+z), 
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and let Zj+i be any element of Sj+i. It is clear that 5^+1 C 5^^^ + ^j+i- Now at each 
step of this iteration the size of U is depleted by at least |t/||A|/2iV. The maximum 
possible number of steps is thus no more than A^/|A| which, by Lemma l4.ll (i), is at 
most {2/kY. □ 

Lemma 5.9. Suppose that \\ {{Ax — 02)1^2)^ ||oo > e for at least eN/2k values of x. 
Then there is a pair {R,?}) with \R\ ^ {2dk / rje)^^'^ , rj ^ {2dk / rje)'^^'^ and so that the 
associated function "^Ai = V'_r ^ satisfies 

(i) \\iP2*i^i-H\i ^ 2-"rt3; 

(ii) There is a function 6 : G ^ G* and a set X C G with cardinality at least eN/8k 
such that for all x ^ X we have 

\\{ex-l)A\\i ^ e/8 and 

Proof. Let Z be the set of all x for which || {{A^ — a2)ip2)^ ||oo > e and let lu : Z ^ G* 
be a function such that 

for dl\ X E Z. If there are at least \Z\/2 values oi x E Z for which ujx € ^/'^ then 
let X be the set of such values, lei R = R, rj = 2~^^t^'^ri/ d^k^ and let 9 = uo. It is a 
straightforward matter to check, using Lemma (4.41 fvii) and (ix), that both conditions 
(i) and (ii) are satisfied. 

Alternatively, suppose that there are at least \Z\/2 values oi x E Z such that ^ '1^2 ■ 
Let U be the set of such points. Apply Lemma 15.81 with k = e?72/60. This gives 
sets Si, ... , Sk ^ U and points zi, . . . ,zk G U where K, it can be checked, satisfies 
K ^ {2dk/r]ef^<^. Write Vt = {u},„...,u},J and let X = [jS,. Then |X| ^ eN/8k. 
Furthermore if x G X then there is some i such that x G -BH,ej72/60 + ^i- By Lemma f5.8[ 
this means that 

|((A.-a^)z^2)^^.J| > 6/2. 

Now let R = RU Q and set rj = 2~^^e^ri2/ dk^, where d = \R\. Property (i) is a conse- 
quence of Lemma 14.41 fvii). and (ii) follows from Lemma 14.41 fiv). The demonstration 
of Lemma 15.91 is concluded by a slightly tedious computation, which is necessary to 
confirm that \R\ and rj satisfy the stated bounds. □ 

Proposition 5.10. Suppose that \\ {{A^ — af)^/'2)^ ||oo > e for at least eN/2k values of 
X. Then there is a pair {R,rj) with \R\ ^ {2dk / rje)^^'^ , rj ^ {2dk/rie)~^^'^ and ind{R,rj) ^ 
md{R,T]) + 2'^'^k-h\ 

Proof. Let {R, rj) be the pair constructed in Lemma [5.91 and let X and 9 be the objects 
associated with it so that conditions (i) and (ii) of that lemma are satisfied. Thus |X| ^ 

eN/8 and, for all x G X, one has \ {{Ax - af)^/'2)^ {9x)\ ^ e/2 and \\{9x - l)^i||i ^ e/8. 
We will show that if x G X then 

Y.ial-^'^-alfUy) > e'/Wk. (5.4) 
y 



{{A^^ - a^2)^2T {Ox) ^ e/2 
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Once this is shown, an apphcation of Lemma 15.41 with 01 = ^/'2, </>2 = V^i, / = ^ = 
and K = 2~^^k~'^e^ gives 

. , ll/2||i-||/l||i ^ E.Eyj^r'-Cy^YMy) ^_8,_i3 

mdi{R,r]) -mdi{R,r]2) = — ^ — 2 k e, 

which is at least 2~^k~^e^. It follows from Corollary 15.51 that indeed mdi{R,rf) ^ 
indj(-R, T]) + 2~^k~^e^ and then, by another application of Corollary 15. 51 that ind(-R, rf) ^ 
ind(i?,r/) + 2-i°A;-ie^ 

It remains, of course, to proye ()5.4p . Suppose, throughout what follows, that x G X 
and write g{n) = A'^^{n) — ^2 a^^d f{n) = g{n)6x{n). Thus 

n 

We haye, then, 

\f*^i*MO)\ > \f *MO)\-\\f * 0^2*^1 -i^2)\\oc> > 3e/8. (5.5) 



Recalling that 6^ is a character, so that 9x{n — m) = 6x(n)9x{m), one has 

/*V'i*^2(0) = gO^ * iii * i/j2i0) 
= 9* 0^i)i * 61^^2(0) 

= ^?*t^i*^.^2(0)+ (^7*^(^-1)) *M2(0), (5.6) 

which we may write as Ei + E2. Now ||6'^^/'2||i ^ ||V^2||i = 1, and so 

1^2! ^ lk*^^i(^-l)||oo ^ ||(?||oo||(^^x-l)^i||i < e/8. 
Together with (j5.5|l and ()5.6|1 . this implies that \Ei\ ^ e/4. But 

\E,\ ^ 5^|(A+--a^)*V^i(y)|^2(2/) = $^|5r^-«^1^2(2/) 

y y 

^ ( - "2)^2(2/)) ( Uy)) = ( - ^2fUy)) ■ 

y y y 

This confirms ()5.4|1 . and hence the proposition. □ 

Combining Propositions 15 . 61 and 15 . 1 Ol concludes the proof of Proposition 15.31 and hence, 
by the comments following the statement of the proposition, of Theorem 15.21 □ 

6. The Counting Lemma 
For any k functions /i, . . . , : G ^ M write 

T{fi,...Jk) = f ■■■fk{xk). 

xiH hxk=0 

Obserye that T is a multilinear operator and that 

T{fu...Jk) = [ /i(7).../fc(7)c?7, 
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where the integral is taken with respect to the normahsed counting measure on G* (so 
that it equals the sum over all 7, divided by N). Throughout this section we will assume 
that Ai, . . . , Ak are subsets of G and that ipi and '02 come from a pair (i?, rf) which is 
e-regular for the Ai. The following lemma, a generalisation of a lemma in [2^], will be 
used several times later on. 



Lemma 6.1. Let f : G 



be a function with 



^ 1. Then 



T (0}/', ^2, . . . , ^2, M'/') - Yl 



X] 



Proof. From Lemma H^ (vi) we know that | {^\^'^ * ip2* — ? *'V^2 — V^^^^ ^ 2^eip'^/'^{x) 



Thus 



= (0}^^ *02* *^2 - ^}^^] (2;) 



Proposition 6.2 (Counting Lemma). Suppose that Ai, . . . , C and i/iai xi, . . . , 
are e-regular values with xi + . ■ ■ + = 0. Then 



T {Ar^i^\'\ A+-^2, . . . , ^rr V2, ^r'^^!^') - «r:i«2>3f2 • • • <4-i>ti 



^ 4.2\. 



Proof. The multilinearity of T implies that for any 2k functions /i, /fc, /{,..., we 
have 

TWi, /2, . . . , /fc-l, /fc) = r(/l, /2, . . . , //c-l, //c) + r(/i, /2, . . . , /fe — /fe) 

+T{f[ — fi, /2, . . . , /fc) 

k-l 

+ 7'(/(, ■ . ■ , //-I, /i - /i, . . . , /fc-1, fk), (6.1) 

i=2 

which we write as P1 + P2 + P3 + E!=2 Qi- We will apply with /( = a'[\ipl^'^, // = 
(2 = 2, . . . , A; - 1), = a^^l0y^ A = A+^^V^}/', /. = A+^'V^2 {^ = 2, . . . , k - 1) 
and /fc = A'^^'^ipl^'^ . Each of the terms -Pi, -P2, -P3 and will be estimated separately. 

Estimation of Pi. Using the multilinearity of T and Lemma [6.11 with / = 1, we have 

I Pi — «i,\«2,2'^3,2 • • • '^fc-l,2'^fc|l| ^ 2'^^- 

Estimation of P2. Again we use Lemma ffi. If this time with / = A'l^'' — a^\. We have 
IP2I ^ 2''e. 
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Estimation of P'^. By the Cauchy-Schwarz inequality we have 

lAl = lU[-h)\l)f2{l)...fk-l{l)f',{l)d^ 

1/2 k-1 
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i(/{-/i)"(7)r /2(7) 



j=3 



Now for each i = 3,4, — 1 we have ||/i||oo ^ II /i 111 ^ follows from 

Parseval's identity that ||/(,||2 is also at most 1. Our attention turns, then, to the 
bracketed expression. Writing F{n) = A^^^{n) — ai\ one has, using Parseval's identity, 

u ~ fini)\" \f2{i)\' ^ /i((^r^-<i)<y(7)ri£(7)prf7 

X 

But, using property (ii) of regularity (Definition I5.1jl and Lemma f4.4l (viii) we have 
\\{F4/')*^2h ^ ||^y'(F* ^2)112 + ||(i^^y')* ^2 ^2)112 

= ( - <2ry^" + uFi^y") * ^2 - ^Pi^\F * ^2)ii2 

y 

^ 2e. 

It follows that IP3I ^ 2e. 

Estimation of the Qi. For each i = 2, . . . , k — 1 the quantity Qi succumbs to the estimate 

fc-i 



i-l 



^ iii?ii2-nii/;iu-ii(/;-/o 

^ IK/: -/OIL 



loo ■ Y\ ll/jll°° 

j=i+l 



Wfkh 



((^r'-<2)^2)' 

By property (ii) of regularity fPefinition 15. this is at most e. 

Combining the estimates for Pi, P2, P3 and Qi with ()6.H) completes the proof of Propo- 
sition O □ 

7. fc-TUPLES OF SETS WITH FEW ZERO-SUMS 

We retain the notation of the last two sections. We are now in a position to prove 
Theorem 11.51 Theorem II .41 and Corollarv 11.61 will be easy consequences. Let us restate 
the result for the reader's convenience. If Y[i=i -^i a cartesian product of sets in G, we 
define a zero-sum k-tuple to be a fc-tuple (xi, . . . , Xk) G HiLi -^i with Xi + ■ — h = 0. 

Theorem II . 51 Let k ^ 3 be a fixed integer, and suppose that Ai, . . . ,Ak are subsets of G 
such that Y[i=i ^^'5 o{N^~^) zero-sum k-tuples. Then we may remove o{N) elements 
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from each Ai so as to leave sets A-, such that HiLi ^^■^ '^'^ zero-sum k-tuples. 
Proof. We begin by setting up a definition analogous to Definition 11.71 

Definition 7.1. Suppose tliat Ai, ...,74^ C G and let {R,ri) be e-regular for the Ai. 
We define the reduced sets A[ to be the sets obtained by deleting, for each i = 1, . . . , k, 
all X G Ai for which x is not a regular value, or for which a^j^ ^ 4ei/'= or afg ^ 4e^/'^. 

It was a simple matter to show that obtaining the reduced graph from the original graph 
involved the deletion of rather few edges (cf. p.lj) ). To show that |A \ A'| is small is 
a little subtle. In fact it is to obtain such a result that we have been dealing with the 
functions ipr,s rather than the functions f3r,5- Up until now, either would have worked. 

Lemma 7.2. Let AO G and let ip = ip^^s for some choice ofTCG* and some 6 > 0. 
Let p > 0. Then the number of x & A for which A * iIj{x) ^ p is no more than pN. 

Proof. Write /3 = Pr,s, so that ip = [3 * [3. Let S be the set of all x G A for which 
A * ip[x) ^ p. Certainly, then, 5* * ipi^x) ^ p for all x & S. Thus we have 

= ^5(a;)5*^/>(x) ^ p\S\. 

X 

The result follows immediately. □ 

Recall Definition 17.11 Since the number of x which fail to be regular is no more than 
eA^, it follows that 

for each i. □ 

Proof of Theorem Suppose that there are at most 5N^~^ zero-summing /c-tuples 
(oi, . . . , flfe) G nf=i ^i- Choose a function e = e{6) such that e — as 5 — 0, but such 
that > iy(2^^A;^e~^). This means that there is a pair {R,ri) which is e-regular for 
A, and for which the associated constants d = \R\ and rj2 = 2~^^e^ri/ dk'^ satisfy the 
condition S'^^/ryf*^ < e. Again, this is an easy check since everything but the height of 
the power of twos is essentially irrelevant. Consider the reduced sets A[ coming from 
such a regular partition (i?, rj). As we have seen, \A[\ ^\Ai\- V^ke^l^N. We claim that 
there are no zero-sum fc-tuples in ni=i -^i- Indeed, suppose that Xi G are such that 
a^i + ■ ■ ■ + Xfc = 0. Then the counting lemma tells us that 

However every /c-tuple . . . , Wfc) contributing to the sum 

MlH hMfc=0 

leads to a zero-sum fc-tuple (xi + mi, X2 + M2, • • • , + u^) G JliLi ^i- By Lemma 14.41 
(iii) both ||'?/'i||oo and ||'?/'2||oo are at most S/r^g^- Since there are no more than 5A^^~^ 
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zero-sum fc-tuples in ni=i ^^^^ means that 

This is a contradiction, and so Y['i=i ^'^^^ indeed cannot any zero-sum fc-tuples. □ 

Proof of Theorem \1.4\ Simply apply Theorem II .51 with k = 3 and Ai = A2 = A3. □ 

Proof of Corollary [TE Apply Theorem Ol with G = Z/2NZ, k = 3. li A C [N] then 
A may be identified with a subset of Z/2A^Z by reducing modulo 2N. Set Ai = A2 = A 
and A3 = —A. The result follows by observing that if x,y,z G A then x + y = z 
(mod 2A'") if, and only ii, x + y = z. □ 

Remarks on bounds. Consider Corollary II .fil as formulated in the abstract of the paper. 
That is, if A C [A^] has 5iV^ summing triples (triples with x + y = z) then it may be 
made sum-free by removing S'N elements. Our proof gives an awful dependence between 
S and S' of the form 1/S = 1^(5'"*"). One may conjecture that a much stronger result 
should be true. It is, however, too optimistic to hope that the dependence between 
S' and S might be polynomial, and we close this section by giving a very brief sketch 
of why this is so. Let 6 > 0, let be a large positive integer and let p and q be 
distinct primes to be chosen later. Recall that by a construction of Behrend [Sj there 
is a set -B C Z/pZ with \B\ ~ exp(— Ci(logp)^/^)]9 with the Behrend property, that is 
the only triples {x, y, z) with x + y = 2z are the trivial ones for which x = y = z. 
This construction involves choosing suitable integers r and d, taking the lattice points 
on the sphere S((0, . . . , 0), r) C M*^ and then using an affine transformation to project 
to Z/pZ. Now the lattice points on a sphere clearly have the Behrend property, but 
the same is also true of the union X U 2X U AX, where X is the set of lattice points 
on the sphere S((r, 0, . . . , 0), r'), provided that r' < r/7. Using this set instead, we 
may find S C Z/pZ with 15*1 ~ exp(— C2(logp)^''^)p such that S has both the Behrend 
property and also the additional property that S H 2S H 43 has cardinahty at least 
exp(-C3(logp)i/2)p. Set T = 5 n 2S: then |T n 2T| = |5 n 25 n 4S\ satisfies the 
lower bound just mentioned. By choosing p ~ exp(— C4(log(|))^/^)/5 we may take 
|T| ~ 6p'^. Now choose q so that pq ^ N and consider the set [/ = T x Z/gZ (which 
may be considered as a subset of Z/NZ). If (ti,ai) + (^2,02) = (^3,03) then certainly 
^1+^2 = ^3- But ^3 = 2s for some s G 5* and so ti + 12 = 2s, which implies that ti = ^2- 
It follows that the number of summing triples in U is bounded above by |T|g^ ~ 6N. 
However, in order to remove all summing triples from U we must delete at least one of 
{x, a) and (2x, 2a) for all x G T fl 2T, a G Z/qZ, and to do this requires the removal of 
at least exp(— C5(log(|))^/^)A^ elements. 

I do not know a similar example in (Z/2Z)" - that is, it may be that the dependence 
between 5' and 6' in Theorem II. 41 is polynomial when G = (Z/2Z)"'. It is my belief that 
the only bounds known in the classical Proposition II. HI are also of tower type. 

8. A QUESTION OF BERGELSON, HOST AND KRA 

We again use the notation of ^andlHl In this section we prove Theorems 11.91 and II. lUI 
starting with the former. We restate it for the reader's convenience. 
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Theorem 11.91 Suppose that a, e > 0. Then there is NQ{a, e) such that if G is an 
abelian group of size N > Nq^u, e) with N odd, and if A <Z G has size aN, then there 
is some d such that A has at least {a^ — e)N three-term arithmetic progressions 
with common difference d. 

Proof. Define three sets ^1,^2,^43 by Ai := A, A2 := -2A = {-2x : x e A} and 
^3 := A. By Theorem 15.21 we may find a pah {R,r]) which is e-regular for Ai,A2 and 
^3. For a given x G G, set Xi = X3 := x and X2 '■= —2x. If Xi, X2 and X3 are ah regular 
values then the counting lemma (Proposition lfi.2j) tells us that 



C 32e. 



Since the number of x for which some Xi fails to be e-regular for A^ is at most 2eiV, it 
follows that 

T{Ar^^\'\ Ar'^2, Ar^i^l") ^ ^ <\<^2«3fi - 34eiV. (8.1) 

X X 

Now it is a fairly straightforward matter to check that 

Y,nAr'i^\'\Ar'^2,Ar'i,\'^) = j^piA-j^d), 

X d 

where P{A\ d) is the number of 3-term arithmetic progressions in A with common dif- 
ference d and the weight i>{d) is given by 

uid) := J2ijl^\y)C2i^2){y + d)i,\'\y + 2d), 
y 

Here we have written |V'2(^) = 4'2{2t). Together with (jS.lll this implies that 

^ P{A; d)v{d) ^ <i<2<i - 34eiV. (8.2) 

d X 

Now we have = = [A* tlji){x), and furthermore (since = IGI is odd) 2 — 
{A * ^iIj2){x). For notational convenience write ai{x) := = ^3^ and a2{x) := 02^. 
Now we have ipi = Pi * Pi and ^ip2 = \P2 * \P2, where Pi, P2 are the smoothed Bohr 
cutoffs used to define ipi and ip2- It follows that 

^ai(x)a2(x) = 5Z(^*'^i*'^i)(^)(^*i'^2*i/32)(x) 

X X 

= 5^(A*A*l/32)(x)^ 

X 

^ iV-i(^(A*/5i*i/52)(x))2 = a^N. 

X 

Combining this with the fact that a2{x) = aN, we obtain the inequality 

X^"i,i"2,2«3,\ = ^a?(x)a2(x) ^ (^«i(x)a2(a;))^/^a2(a;) ^ a^N. 

XX XX 

Comparing with ()8.2j) yields 

J2 P{A; d)v{d) ^ {a^ - 34e)iV, 
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and so provided > NQ{a,e) we have 

J2 Pi^; dMd) > {a^ - 35e)iV. (8.3) 

Note that if is too small then the cutoffs ipi,ip2 will be almost entirely supported at 
zero, and such a conclusion would not be correct. Now it is easy to see that z/((i) = 

T{ipy^,ilj2,ipl^'^), and so Lemma I^TTl with f = 1 gives 

^z/(d)^l + 8e. (8.4) 

d 

Thus there is some d ^ such that P{A; d) ^ (a'^ — 35e) /(l+8e), which implies Theorem 
11.91 after redefining e. □ 

We move on now to outline the proof of Theorem II. 10^ which is the same result but 
with G replaced by {1, ... , A^}. 

Proof of Theorem M.lfK Suppose that A C {1, . . . , A^} has density a. We may regard 
y4 as a set y4 C Z/A^Z in a natural way; observe, however, that 3-term arithmetic 
progressions in A need not be three-term progressions in A. To get around this problem, 
a trick is required. As before we take a pair (i?, 77) which is e-regular for A, but now 
we insist that 71/2,71 G -R, where 71 is the character x 1— > q'^-^^^/n g^^^ ^^^^ maps x 
to e^'^*^^/^, where 2 is the multiplicative inverse of 2(modA^). It is a trivial matter 
to achieve this (at the expense of an inconsequentially worse upper bound for \R\) by 
starting the iteration used to prove Theorem 15.21 with the pair (-R, r/) = ({71,71/2},!) 
instead of (0, 1). 

Now we argue exactly as in the proof of Theorem 11.9^ obtaining the bound ()8.3j] . that 
is to say 

P(A- d)v{d) ^ (a^ - 35e)Ar. (8.5) 

d 

Given d G 'L/N'L, write \d\ for the magnitude of that residue d G — {(A^ — 1)/2, . . . , (A^ — 
l)/2} with d = d{modN). We will show that almost all of the sum on the left in ()8.5p 
is concentrated on those d for which \d\ is small, which is good as such d are rather 
likely to correspond to 3-term progressions in A, rather than just (mod A^) progressions 
in A. 

Suppose then that \d\ ^ eA^. Write Ss for the set all all x such that \x\ ^ 6N. One has 
^(d) ^ ^^J/'(x)(iV^2)(a; + rf)V^l/'(x + 2d)^,(rf) 



^ ( 5^^i(x)(i^2)(x + ci)5,(d))'/'( J]^i(x + rf)(i^2)(x + 2d)Y^' 

x,d x,d 

= {YM^)C2^2){x + d)S,{d)Y^\ (8.6) 

x,d 
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Now we have S^{d) ^ 5*^/2(3;) + S^/2{x + d). We split the sum in ()8.6|) into two parts 
accordingly, that is to say as 

El ■=^^i{x){\i)2){x + d)S^/2{x) = ^^i{x)S^/2{x) 

x,d X 

and 

S2 := Y,^i{x){\i;2){x + d)S,,2{x + d) = Y,i^2{2y)S,,2{y). 

x,d y 

To estimate Si, observe that if |x| ^ eN/2 then \\x\\r ^ e/2, the notation being that of 
§ni by virtue of the fact that 71 G R. If, as we may, we assume that \R\ is much smaller 
than l/rj and that rj is enormously smaller than e it follows from ()4.2j) that Si ^ e (this 
is true by a huge margin). 

Turning to S2, note that if \y\ ^ eN/2 then ||2?/||i{ ^ e/2, this following from that fact 
that 7i/2 G R. Once again, then, we have the estimate S2 ^ e by a vast margin. 

Collating these observations together with ()8.(ij) leads to the bound 

|d|^eV 

and so in view of ()8.5j) we obtain 

P(A d)v{d) ^ (a^ - 37e)Ar. 

\d\^t 
d^O 

Together with ()8.4|) this implies that there is some d ^ 0, \d\ ^ eA^, such that P{A, d) ^ 
(a^ — 45e)A^. Now of the arithmetic progressions in A with common difference at most 
2eN do not actually correspond to genuine progressions of integers under the inverse 
of the projection map {1, . . . , N} Z/A^Z. In conclusion, then, the set A contains at 
least (a^ — AlejN three-term progressions with common difference d. □ 

9. Miscellaneous remarks 

In this section we assemble a variety of remarks concerning Theorem 15. 2| its application, 
and its relationship with results in the literature. 

/. Relationship with Szemeredi's regularity lemma. In addition to the analogies we 
have already drawn between Theorem 15. 21 and SzRL there is another, more formal, link 
between the two theorems. When applying SzRL in number theory one might consider 
a graph derived from a subset of an abelian group by something akin to the Cayley 
graph construction (cf. [HlEni)- Perhaps the simplest situation is the following. Let 
G = (Z/2Z)"', let A C G and let F be a bipartite graph on vertex set G x {0, 1}, {g, 0) 
being joined to {g', 1) precisely ii g + g' & A. If X C G we will write Xj for X x {i} 
(z = 0,l). 

For the rest of this discussion we revert to the language of §21 Suppose that if ^ G is 
e^-regular for A, this having a fairly simple meaning since G = (Z/2Z)'^. We claim that 
if Xi, X2 G G, and ii x = Xi — X2 is an e^-regular value, then the pair {H + xi, H + X2) is 
e-regular in the graph-theoretic sense of [JT] To see this, suppose that U + Xi <^ H + xi 
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and V + X2 'i^ H + X2 both have cardinahty at least e\H\. Then e{U + Xi,V + X2) is 
exactly J2u v ^"""^(^ + v)U{u)V{v). This may be estimated by Fourier techniques on H. 
Indeed 

e{U + x^,V + X2) = l/fl-i 5^ 1+^^(7)^/(7)^(7) 

\H\ 

It follows that 

\d{U + Xi,V + X2)-d{H + xi,H + X2)\ = 

This confirms the claim. Now partition both vertex classes of F into cosets H + 

Xi,. . . ,H + Xfc. For fixed i there are at most e^k values of j for which 

not regular, and so this partition is e-regular in the graph-theoretic sense of Szemeredi. 

Have we, then, simply recovered SzRL? In fact, rather more has been achieved. The 
graph F was of a special type (essentially a Cayley graph over G) but in return we were 
able to insist that the vertex classes in SzRL were not arbitrary sets, but subgroups of 
G. It is hard to formulate this principle at all precisely for groups other than (Z/2Z)"'. 
Morally speaking, however. Theorem 15.21 says that if we have a Cayley-type graph over 
a group G then the classes in SzRL may be chosen to have a rather strong structure 
which is related to G. 

II. Enumeration of sum-free and related sets. Let 6 = (61, . . . , 6^) be a fixed /c-tuple 
of non-zero integers. For any real numbers Xi, . . . , we write Lfe(a;i, . . . , Xk) = biXi + 
■ ■ ■ + bkXk- We say that a set A C [A^] is strongly L^-free if there are no solutions to 
Lh{ai, . . . , afc) = with ai E A for all i. A is deemed to be weakly L^-free if the only 
solutions to Lh{ai, . . . , a^) = are trivial, that is to say they arise by partitioning [k] 
into Ji U ■ ■ ■ U It such that Yliei = ^ for each j, and then taking all of the [i G Ij) 
to be equal. When b = (1, 1, —2), a strongly L-free set must be empty, and a weakly 
L-free set is the same thing as a Behrend set (cf. 33) • When 6 = (1, 1, — 1), the notions 
of strongly and weakly L-free coincide with that of a sum-free set. The reader is referred 
to [21] for more information on solving linear equations in sets of integers. 

In this subsection we use Theorem l5.2l to get estimates on the number of weakly L;,-free 
subsets of [N] . Many of our results would extend to arbitrary abelian groups, but the 
discussion of general linear forms is complicated by the possibility of torsion and we do 
not give it here. 

Machinery for counting sum-free sets was developed by I.Z. Ruzsa and the author in 
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a series of papers fSJ Ej. The following result may be proved by extending the 
methods used in those papers in a straightforward manner. 

Proposition 9.1 (Granularization). Let L = be a fixed linear form as above. Then 
there is a family T of subsets of \N\ with the following properties: 

(i) = 2°(^); 

(ii) If A C [N] is weakly L-free, then A C F for some F G J-"; 

(iii) Each F e has o{N''~^) solutions to . . . , /fc) = 0. 

Now Theorem 15.21 gives structural information about the sets in JF. Indeed suppose 
F e JF and set M = 2{\bi\ + ■■■+ \bk\)N. Apply Theorem Ol with G = Z/MZ and 
Ai = Fi (mod M), where Fi = biF. The choice of M guarantees that /i + ■ ■ ■ + /fe 
(mod M) if, and only if, /i + ■ ■ ■ + = 0. It follows that there is a set F' C F 
with |F \ F'l = o{N) and such that F' is strongly L-free. This leads to the following 
improvement of Proposition 19.11 

Proposition 9.2. Let L = Li, be a fixed linear form as above. Then there is a family 
T' of subsets of \N\ with the following properties: 

(i) \r\ = 2"(^); 

(ii) If AC [N] is weakly L-free, then \A \ F'\ = o{N) for some F' e T' ; 

(iii) Each F' G T' is strongly L-free. 

Note that this implies that r^{N) = o{N), since when b = (1, 1, —2) the collection JF 
must consist of just the empty set. Using Proposition 19. 21 and the methods of our papers 
with Ruzsa, one can prove the following. Write LF(A^) for the collection of all weakly 
L-free subsets of [A^]. 

Theorem 9.3. Let L = Lf, be a fixed linear form. For each N denote by /l(A^) be the 
cardinality of the largest strongly L-free subset of [N]. Then log2 |LF(A^)| = /l(A^) + 
o{N) (the o-term may depend on L). 

Proof. Consider the family JF' constructed in Proposition 19.21 For each F' G J-"', 
simply count the sets A C [A^] which satisfy 1^4 \ F'| = o{N). Since ^ /^(A^), the 
number of such sets A is no more than 2'f'^^^^~^°^^\ But all sets A G LF(A^) arise from 
some F' G J-'' in this way and so, since = 2°^^\ we do indeed have the estimate 
|LF(Ar)| = 2^^(^)+°(^). □ 

Observe that the result is best possible apart from the o-term, since LF(A^) certainly 
contains all subsets of a strongly L-free set with maximal cardinality. 

It turns out that Proposition 19.11 can also be derived from Theorem 15. 2[ though with 
much weaker quantitative information than that obtainable using the methods of jT^ 
HSllTni- This leads to a unified treatment of the enumeration of L-free sets. We sketch 
the argument here, restricting attention to sum-free sets for simplicity. 

To prove Proposition 19.11 it suffices to prove an analagous result with [A^] replaced by 
G = Z/pZ, where p G [2A^, 4A^] is a prime. Subsets of [A^] may be regarded as subsets 
of G by reducing mod p, and a set A C [A^] is sum-free if and only if A (mod p) is 
sum-free. Let e > 0. For each sum-free set A (1 G, find an e-regular pair [R, rj) for A. 
There is some p G [2e^/'^, 46^^^] such that the cardinality of the set 
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is at most Ae^^^N . Pick such a p, and consider the set A consisting of all regular values 
X such that 0:1(0;) ^ p (note that this differs from the notion of reduced set in Definition 
17.11 only in that we do not restrict ourselves to values of x which also lie in A). Take 
JF to be the collection of all these sets A. Now it is easy to see that |yl \ y4| is small 
and, using the counting lemma, that A is sum-free. It is rather less easy to see that 
1^1 = 2°("), and we only give a very brief sketch of the argument. 

In estimating |JF| up to factors of 2"'-"'' we may ignore any non-regular values of x and 
simply count sets having the form A° = {x : ai{x) ^ p}. Observe that if ai{x) ^ p+e^/^ 
then ai{x+y) ^ p for all y G -Bij,^, where k = rje 1/720. This is a consequence of Lemma 
14.41 (v). Thus A° is a union of x for which x + Bji^^ ^ A° together with at most \Sp\ extra 
points. Now by a classical argument of Dirichlet (invoking his principle of the pigeons) 
Bji f^ contains an arithmetic progression P of length at least kA^^/'^L Roughly speaking 
this means that A° may be written as a union of longish arithmetic progressions together 
with Sp. It is then easy to establish a bound ^ 2^^''^'^, where c(e) — as e — 0. 

///. Finding a single regular value: Bourgain's bound for rs{n). We owe a large debt to 
Jean Bourgain and his proof [6^ of the bound r^{n) = 0((loglogr;,/ logn)^/^), which is 
the best currently known. Bourgain's argument essentially amounts, in the language of 
the present paper, to finding a single value of x and a pair {R, rj) for which x is e-regular. 
If one is interested in such a weakening of Theorem 15.21 then substantial improvements 
can be made in the bounds. By far the most important difference between our argument 
and that of Bourgain is that he uses an i°° notion of index in place of our i"^ definition 

The fact that a large regular pair can be found was first observed in the context of 
graph regularity by Komlos (unpublished) and elaborated upon by Peng, Rodl and 
RucinskipJ^. In the arithmetic setting the argument goes through particularly cleanly 
when G = (Z/3Z)", and is then essentially the argument used by Meshulam ^Sl- This 
was based on the original argument employed by Roth [201 to prove that r3(n) = o(n), 
but for general groups G Roth's argument does not fit into the framework of regularity 
since it involves passing to substructures whose size is a small power of A^. 

IV. Higher arithmetic regularity? The discussion of III suggests that the analytic proof 
of r3(n) = o{n) |6ll20j and the proof via the regularity lemma [Hll^ are perhaps not as 
different as previously thought. There is also Gowers' analytic proof that r^ln) = o{n) 
fn\ and a proof via the Frankl-Rodl regularity lemma for 3-uniform hypergraphs jH], 
hereafter termed FRHRL. It would seem to be interesting to ask whether there is an 
"arithmetic" version of FRHRL which is analagous to FRHRL in the same way that 
Theorem 15.21 is analagous to SzRL. A proper understanding of this might be expected 
to lead to an improvement in the known bounds for r4(n). Very recently, regularity 
lemmas for fc-uniform hypergraphs have been announced independently by Gowers and 
(various combinations of) Nagle, Rodl, Schacht and Skokan. Of course, these form part 
of a more general puzzle. 

The diagram is an attempt to outline what might be hoped for. The three question 
marks refer to as yet untreated problems. The problem of finding a single regular 
structure in the hypergraph setting has probably not been investigated, maybe because 



28 



BEN GREEN 



no potential applications are known. It is quite likely, however, that such a result might 
not be too difficult to obtain, at least after reading '9] or the more recent works on 
hypergraph regularity. For that reason it is given the symbol ?* in our picture. 



? 



? 



Hyper- 
graphs 



FRHRL 




Graph Setting 



Theorem 



Arithmetic Setting 



Bourgain [H] 




Graphs 



SzRL 1211 



Koml6s,[in! 



Full regularity 



One regular object 



Since the ffist edition of this paper in October 2003, T. Tao and the author have made 
some progress on these issues. In particular we have a preprint obtaining a bound 
r4((Z/5Z)"') = 0(A^(log A^)~^), where 5" = A^. This is done, essentially, by ffiling in 
the top right corner of the above diagram for the particular group G = (Z/SZ)". As a 
result of this work we are able to guess at the correct statements for all of the question 
marks in the diagram. Proving these in full generality may be a long way off, however. 

One consequence of a suitably formulated "higher arithmetic regularity lemma" might 
be a solution to the following conjecture, which is closely related to GoroUarv 11.61 and 
the discussion of II. Let r, m be integers with r ^ m and suppose that L is an r x m 
matrix of integers with rank r. Say that a set A C [A^] is L-free if there are no vectors 
X G A"^ for which Lx = 0. 

Conjecture 9.4. Let A C [A^], and suppose that there are o(A^'"~^) vectors in A"^ for 
which Lx = 0. Then A = B U C , where B is L-free and \C\ = o{N). 

Another application of such a result might be a positive answer to Question 11.81 for 
four-term arithmetic progressions. 



10. A GOWERS TOWER FOR (Z/2Z)'' 

In this section we show that our regularity results must necessarily give terrible, tower- 
type bounds. In the context of graphs such a phenomenon was discovered by Gowers 
|10j : he constructed graphs in which the smallest e-regular partition has a number of 
parts which grows like W{e~^^^^). Our lower bound will not be quite so spectacular 
but can hardly be described as slowly-growing. Before stating it, we state and prove a 
preliminary lemma. 
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If M is a positive integer, define F{M) to equal M if M ^ 19 and [M/4J if M ^ 20. 
Define the sequence {di}i^o by do = and di+i = p{2'^i+'i^-i+-+<^o'^ for i ^ 0. Observe 
that di = 1, ci2 = 2, ^3 = 8, ^4 = 512, 4 = 2^21 and that, for z ^ 6, rf, ^ W{i - 2). 

Lemma 10.1. Let M 1 he an integer, and write V = (Z/2Z)^^'^'^\ Then there are 
M vectors ^1, . . . , G V with the property that any subset of 95 percent of them span 
V. 

Proof. If M ^ 19 this is trivial - simply take .^1, . . . ,^a/ to be any basis for V . For 
M ^ 20 we use a random approach. Choose the independently at random using the 
uniform distribution on V . Let f/ be a fixed codimension 1 subspace of V . The events 
{^i G t/} are independent Bernouilli random variables, and we may invoke a standard 
tail estimate such as (|2], Theorem A. 1.4) to deduce that 

P (at least 95 percent of the lie in U) ^ e-^''"^. 

Thus the probability that some codimension 1 subspace U contains 95 percent of the 
vectors is no more than 2^^^^^e~^^^^, which is certainly less than 1. It follows that 
there is indeed some choice of the satisfying the conclusion of the lemma. □ 

Let G = (Z/2Z)" and, as usual, write = \G\. The next theorem, which is the 
main result of this section, provides an example of a function / : G — > [0, 1] such that 
the largest subgroup H ^ G which is e-regular for / has extremely large index. The 
definition of what it means for x to be e-regular for / is the obvious one, given what was 
said in ^ A standard probablistic argument such as the one in |10|, Lemma 2 would 
produce a genuine set A with much the same properties if one was desired. 

Theorem 10.2. Let e < 1/20 and suppose that N is sufficiently large. Then there is a 
function / : G — > [0, 1] with the property that any subgroup H which is e-regular for f 
satisfies \G/H\ ^ Vr(|log2(l/e) - 5). 

Proof. Let s = [|log2(l/e) — 2j. Take a fixed nested sequence of subspaces G = Hq ^ 
Hi ^ . . . ^ Hs together with subspaces f/j+i C Hi so that Hi = iJj+i + t/j+i, where 
dimj^/2Z Ui = di (the numbers di are the ones defined at the start of the section). 
Define Vi = Ui-\- Ui^i H — ■ + Uq, so that Hi + Vi = G. For each i, construct a set Bi C G 
as follows. Set M = 2'^«+'^'-i+-+'^o = \Vi\ in Lemma HUHl and take vectors C.v e f/j+i 
indexed by f G V^, with the property that any (1 — e)\Vi\ of them span f/j+i. For each 
V, let ^° = {u e Ui+i : {u,Q = 0}. Writing G = H,+i + Ui+i + Vi, let Bi be the set 
defined by 

Bi = [j {H,^, + C^ + v). 

Bi has cardinality A^/2, and consists of a codimension one subspace in each coset of Hi. 
Define 



/ = \[b, + \b, + ... + ^h 



We will prove, by induction on z, that if H is e-regular for / then H C H^. Suppose 
we know that H C Hi. Take g & G and write it as y + f , where y E Hi and v & Vi. 
Consider the set {Bi)~^^. We have 

(B)^'(x) = I ^ iixeHn{H,+i + ^° + y) 
y i)H y ) ^ Q otherwise. 



30 



BEN GREEN 



Now ^° + i/j+i + y is a. hyperplane (coset of a codimension 1 subspace) in Hi which 
is perpendicular to Therefore its intersection with H is either empty, all of H or 
a hyperplane in H. The latter case arises when H is not a subspace of -ffj+i + 

Supposing this is the case, then |(_Bj)^^(^„)| = \H\/2. Furthermore ii j < i then Bj is, 
by construction, a union of iJj-cosets. Since H C Hi it follows that {Bj)~^^ is either 

empty or else all of H, and so (Bj)^^(^^) = 0. When j > i we have the trivial bound 

\(b~)^^{^^)\ ^ \H\. It follows that ifH ^ Hi+i + e° then 

As -ff is assumed to be e-regular for / this can hold for no more than eA^ values of 
g E G. This means that the proportion of f G for which H ^ -ffj+i + ^° is at most 
e. Let Q be the remaining values of v, of which there are at least (1 — e)|Vj|. By the 
choice of the vectors the collection {C,v}ven spans f/j. Thus H, which lies in 

must in fact be contained in -ffj+i. 

We have completed the inductive step. It follows that if H is e-regular for / then 
H C Hs, which leads to the lower bound on \G/H\ stated in the theorem. □ 
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